Role of Prosody in Automatic Modality Recognition of Bangla Speech

نویسندگان

  • Anal Warsi
  • Tulika Basu
  • Debasis Mazumdar
چکیده

During expressive speech, the voice is enriched to convey not only the intended semantic message but also the speaker’s state of mind and intention. Our goal is to design a tool which can be used in Speech-to-Speech translation system for automatically classifying utterances of Bangla into three modalities namely Statement, Question and Command. Although pitch and intensity features have been commonly used to recognize sentence modality, it is not clear what aspects of the pitch and intensity contour are salient for recognizing sentence modality in Bangla. A set of 30 features derived from 680 speech samples are analyzed to identify the most discriminative set of features for Bangla. Three well-known classification algorithms viz. Decision Tree J48, Support Vector Machine and k-Nearest Neighbor (k-NN) are tested with both the full set and reduced subset of features. A global accuracy of 96.08 % of correct classification has been achieved by k-NN using the reduced subset of features.

منابع مشابه

Separating Words from Continuous Bangla Speech T

In this paper we present a new word separation algorithm for Real Time Speech i.e., Continuous Bangla Speech Recognition (CBSR). Prosody has great impact on Bangla speech and the algorithm is developed by considering prosodic feature with energy. Task of this algorithm is to separate Bangla speech into words. At first continuous Bangla speech are fed into the system and the word separation algo...

متن کامل

Sentence Modality Recognition in French based on Prosody

This paper deals with automatic sentencemodality recognition in French. In this work, only prosodic features are considered. The sentences are recognized according to the three following modalities: declarative, interrogative and exclamatory sentences. This information will be used to animate a talking head for deaf and hearingimpaired children. We first statistically study a real radio corpus ...

متن کامل

Prosodic Modules for Speech Recognition and Understanding in VERBMOBIL

Within VERBMOBIL, a large project on spoken language research in Germany, two modules for detecting and recognizing prosodic events have been developed. One module operates on speech signal parameters and the word hypothesis graph, whereas the other module, designed for a novel, highly interactive architecture, only uses speech signal parameters as its input. Phrase boundaries, sentence modalit...

متن کامل

Formant Analysis of Bangla Vowel for Automatic Speech Recognition

To provide new technological benefits to the mass people, nowadays, regional and local language recognition draws attention to the researchers. Similarly to other languages, Bangla speech recognition scheme is demandable. A formant is considered as the resonance frequency of vocal tract. Formant frequencies play an important role for the purpose of automatic speech recognition, due to its noise...

متن کامل

Subjective Tests and Automatic Sentence Modality Recognition with Recordings of Speech Impaired Children

Prosody recognition experiments have been prepared in the Laboratory of Speech Acoustics, in which, among the others, we were searching for the possibilities of the recognition of sentence modalities. Due to our promising results in the sentence modality recognition, we adopted the method for children modality recognition, and looked for the possibility, how it can be used as an automatic feedb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012